Inducing Fuzzy Decision Trees in Non-Deterministic Domains using CHAID

نویسندگان

  • Jay Fowdar
  • Zuhair Bandar
  • Keeley A. Crockett
چکیده

Most decision tree induction methods used for extracting knowledge in classification problems are unable to deal with uncertainties embedded within the data, associated with human thinking and perception. This paper describes the development of a novel tree induction algorithm which improves the classification accuracy of decision tree induction in non-deterministic domains. The research involved applies the principles of fuzzy theory to the CHAID (Chi-Square Automatic Interaction Detection) algorithm in order to soften the sharp decision boundaries which are inherent in traditional decision tree algorithms. CHAID is a decision tree induction algorithm with the main feature of significance testing at each level, leading to the production of trees which require no pruning. The application of fuzzy logic to CHAID decision trees can represent classification knowledge more naturally and inline with human thinking and are more robust when it comes to handling imprecise, missing or conflicting information. The results of applying fuzzy logic to CHAID induced decision trees are presented in this paper. These have been obtained from sets of real world data, and show that the new fuzzy inference algorithm improves the accuracy over crisp CHAID trees. The results show that the increase in performance is dependant upon the inference technique employed and the amount of fuzzification applied.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining

Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...

متن کامل

Prognostic factors in head and neck squamous cell carcinoma: comparison of CHAID decision trees technology and Cox analysis.

BACKGROUND The purpose of this study was to compare the risk factors obtained from a classical statistical method (Cox proportional hazards model) and the results obtained with classification trees (Chi-square Automatic Interaction Detection [CHAID] model) in head and neck squamous cell carcinoma (HNSCC). METHODS We conducted a retrospective study of 3373 patients with HNSCC and a follow-up l...

متن کامل

Application of Decision-Tree Model to Groundwater Productivity-Potential Mapping

For the sustainable use of groundwater, this study analyzed groundwater productivity-potential using a decision-tree approach in a geographic information system (GIS) in Boryeong and Pohang cities, Korea. The model was based on the relationship between groundwater-productivity data, including specific capacity (SPC), and its related hydrogeological factors. SPC data which is measured and calcul...

متن کامل

Detection of fraudulent financial statements using the hybrid data mining approach

The purpose of this study is to construct a valid and rigorous fraudulent financial statement detection model. The research objects are companies which experienced both fraudulent and non-fraudulent financial statements between the years 2002 and 2013. In the first stage, two decision tree algorithms, including the classification and regression trees (CART) and the Chi squared automatic interac...

متن کامل

Implementation of Classifiers for Choosing Insurance Policy Using Decision Trees: A Case Study

In this paper, we use decision trees to establish the decision models for insurance purchases. Five major types of insurances are involved in this study including life, annuity, health, accident, and investment-oriented insurances. Four decision tree methods were used to build the decision models including Chi-square Automatic Interaction Detector (CHAID), Exhaustive Chi-square Automatic Intera...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004